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Abstract. These notes cover background material on trees which are used in 
the paper [l]. 



1. Trees and paths - background information 

In the paper [T] it is shown that trees have an important role as the neghgible sets 
of control theory, quite analogous to the null sets of Lebesgue integration. The trees 
considered are analytic objects in flavour, and not the finite combinatorial objects 
of undergraduate courses. In this note we collect together a few related ways of 
looking at them, and prove a basic characterisation generalising the concept of 
height function. 

We first recall that 

(1) . Graphs {E,V) that are acyclic and connected are generally called trees. If 
such a tree is non-empty and has a distinguished vertex v it is called a rooted tree. 

(2) . A rooted tree induces and is characterised by a partial order on V with least 
element v. The partial order is defined as follows 

a <h if the circuit free path from the root v — > & goes through a. 

This order has the property that for each fixed b the set {a < 6} is totally ordered 
by <. 

Conversely any partial order on a finite set V with a least element v and the 
property that for each h the set {a < h} is totally ordered defines a unique rooted 
tree on V . One of the simplest ways to construct a tree is to consider a (finite) 
collection £7 of paths in a graph sharing a fixed initial or starting vertex, and with 
the partial order that w ^ cj' iff a; is an initial segment of lj' . 

(3) . Alternatively, let V) be a graph extended into a continuum by assigning a 
length to each edge. Let d (a, h) be the infimum of the lengths of pathsQ between 
the two vertices a, b in the graph. Then 5 is a geodesic metric on V . Trees are 
exactly the graphs that give rise to 0-hyperbolic metrics in the sense of Gromov (see 
for example [2]). 

(4) . There are many ways to enumerate the edges and nodes of a finite rooted tree. 
One way is to think of a family tree recording the descendants of a single individual 
(the root). Start with the root. At the root, if all children have been visited stop, 
at any other node, if all the children have been visited, move up to the parent. If 
there are children who have not been visited, then visit the oldest unvisited child. 
At each time n the enumeration either moves up an edge or down an edge - each 
edge is visited exactly twice. Let h (n) denote the distance from the top of the 
family tree after n steps in this enumeration with the convention that h (0) = 0, 
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then h is similar to the path of a random walk, moving up or down one unit at each 
step, except that it is positive and returns to zero exactly as many times as there 
are edges coming from the root. Hence h (2 \E\) = 0. 

The function h completely describes the rooted tree. The function h directly yields 
the nearest neighbour metric on the tree. If /i is a function such that h (0) = 0, 
it moves up or down one unit at each step, is positive and (2 = 0, then d 
defined by 

d (m, n) ~ h (m) + h (n) — 2 inf h (u) , 

u€: [m,n] 

is a pseudo-metric on [0,2 If we identify points in [0, 2|V^|] that are zero 

distance apart and join by edges the equivalence classes of points that are distance 
one apart, then one recovers an equivalent rooted tree. 

Put less pedantically, let the enumeration be a at step n and b at step m and 
define 

d (a, b) — h (m) + h {n) — 2 inf h (u) , 

u(E[m,n] 

then it is simple to check that d is well defined and is a metric on vertices making 
the set of vertices a tree. 

Thus excursions of simple (random) walks are a convenient (and well studied) 
way to describe abstract graphical trees. This particular choice for coding a tree 
with a positive function on the interval can be extended to describe continuous 
trees. This approach was used by Le Gall [3] in his development of the Brownian 
snake associated to the measure valued Dawson- Watanabe process. 

2. R-TREES ARE CODED BY CONTINUOUS FUNCTIONS 

One of the early examples of a continuous tree is the evolution of a continuous 
time stochastic process, where, as is customary in probability theory, one identifies 
the evolution of two trajectories until the first time they separate. (This idea dates 
back at least to Kolmogorov and his introduction of filtrations) . Another popular 
and equivalent approach to continuous trees is through R-trees ([4 p425 and the 
references there). 

Interestingly, analysts and probabilists have generally rejected the abstract tree 
as too wild an object, and usually add extra structure, essentially a second topology 
or Borel structure on the tree that comes from thinking of the tree as a family of 
paths in a space which also has some topology. This approach is critical to the 
arguments used in pj where tree-like paths are approximated by with simpler tree- 
like paths in 1-variation. (They would never converge in the 'hyperbolic' metric). 
In contrast, group theorists and low dimensional topologists have made a great deal 
of progress by studying specific symmetry groups of these trees and do not seem to 
find their hugeness too problematic. 

Our goal in this subsection of the appendix is to prove the simple representation: 
that the general R-tree arises from identifying the contours of a continuous function 
on a locally connected and connected space. The height functions we considered 
on [0, T] are a special case. 

Definition 2.1. An R-tree is a uniquely arcwise connected metric space, in which 
the arc between two points is isometric to an interval. 

Such a space is locally connected, for let be the set of points a distance at 
most 1/n from x. If z G B^, then the arc connecting x with z is isometrically 
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embedded, and hence is contained in Bx- Hence Bx is the union of connected sets 
with non-empty common intersection (they contain x) and is connected. The sets 
Bx form a basis for the topology induced by the metric. Observe that if two arcs 
meet at two points, then the uniqueness assertion ensures that they coincide on the 
interval in between. 

Fix some point v as the 'root' and let x and y be two points in the R-tree. 
The arcs from x and y to v have a maximal interval in common starting at v and 
terminating at some wi, after that time they never meet again. One arc between 
them is the join of the arcs from x to v\ to y (and hence it is the arc and a geodesic 
between them). Hence 

d {x, y) = d {x, v) + d {y, v) — 2d (w, v\) . 

Example 2.2. Consider the space Vt of continuous paths Xt e. E where each path 

is defined on an interval [0,^ (w)) and has a left limit at [0,^(0^)). Suppose that if 
X G is defined on [0,^), then X|[o.s) £ ^ for every s less than ^. Define 

d {lu, lu') = ^ (w) + ^ (w') - 2 sup {t < min (^ (w) , ^ (w')) \ (s) = co' (s) Vs < . 

Then (il, d) is an i?-tree. 

We now give a way of constructing R-trees. The basic idea for this is quite easy, 
but the core of the argument lies in the detail so we proceed carefully in stages. 

Let / be a connected and locally connected topological space, and h : I ^Mhe 
a positive continuous function that attains its lower bound at a point f G /. 

Definition 2.3. For each x € I and A < h{x) define Cx,\ to be the maximal 
connected subset of {y \ h (y) > A} containing x. 

Lemma 2.4. The sets Cx,\ exist, and are closed. Moreover, if Cx,\ n Cx'^w ^ 
and A < A', then 

Cx'.X' C Cx.X- 

Proof. An arbitrary union of connected sets with non-empty intersection is con- 
nected, taking the union of all connected subsets oi {y\h (y) > A} containing x 
constructs the unique maximal connected subset. Since h is continuous the closure 
Dx^x of Cx^x is also a subset oi {y \ h (y) > A}. The closure of a connected set is 
always connected hence Dx,x is also connected. It follows from the fact that Cx,x 
is maximal that Cx.x = Dx,x and so is a closed set. 
If Cx,x n Cx',A' 7^ and A < A', then 

X e Cx,x U Cx',x' C {y I /i (y) > A} , 

and since Cx^x l~i C'ic'.a' 4'^ the set Cx^x U Cx\x' is connected. Hence maximality 
ensures Cx,x = Cx,x U C'x'.A' and hence Cx',x' C Cx,x- ^ 

CoroUciry 2.5. Either Cx,x equals Cx',x or it is disjoint from it. 

Proof. If they arc not disjoint, then the previous Lemma can be applied twice to 
prove that Cx',x C C^-.a and Cx.x C Cx'.x- D 

Corollary 2.6. If Cx,x = Cx',x, then Cx,x" = C'x'.a" for all A" < A. 

Proof. The set Cx,x,Cx',x are nonempty and have nontrivial intersection. Cx,x C 
Cx^x" andCa;/,A C Ca:',A" ^^^^ceCx^x" SkTidCx'^x" have nontrivial intersection. Hence 
they are equal. □ 
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Corollary 2.7. y e Cx,\ if and only if Cy^h(y) C Cx,\. 

Proof. Suppose that y e Cx.x, then Cy^h(y) and Cx.x are not disjoint. It follows from 
the definition of Cx.x and y G Cx.x that h{y) > A. By Lemma [2.41 Cj hi,.\ C Ca:,A- 
Suppose that Cyji^y) C Ca:,A, since y e Cyji^y) it is obvious that y G Cx.x- D 

Definition 2.8. The set Cx '■= Cx.h(x) is commonly referred to as the contour of 
h through x. 

The map x ^ Cx induces a partial order on / with x ^ y ii Cx ^ Cy. If ft. 
attains its lower bound at x, then Cx = I since {y \ h(jj) > h{x)} = I and / is 
connected by hypothesis. Hence the root v ^ y for all y ^ I. 

Lemma 2.9. Suppose that A G [h{v) then there is a y in Cx.x such that 

h{y) — A and, in particular, there is always a contour (Cx,x) at height A through y 
that contains x. 

Proof. By the definition of Cx,x it is the maximal connected subset of ft > A 
containing x; assume the hypothesis that there is no y in Cx.x with ft (y) = A so 
that it is contained in ft > A, hence Gx,x is a maximal connected subset of ft > A. 
Now ft > A is open and locally connected, hence its maximal connected subsets of 
ft > A are open and Cx,x is open. However it is also closed, which contradicts the 
connectedness of the /. Thus we have established the existence of the point y. □ 

The contour is obviously unique, although y is in general not. If we consider the 
equivalence classes x~y \i x <y and y ^ x, then we see that the equivalence classes 
[y]- oi y < X are totally ordered and in one to one correspondence with points in 
the interval [ft {v) , ft (x)]. 

Lemma 2.10. If z £ Cy_\ and h{z) > A, then z is in the interior of Cy^x- If 
Cx\x' C Cx,x with A' > A, then Cx.x is a neighbourhood of Cx',x'- 

Proof. I is locally connected, and ft is continuous, hence there is a connected 
neighbourhood U oi z such that h{z) > A. By maximality U C Cz.x- Since 
Cz,x n Cy^x ^ we have Cz,x = Cy^x and thus U C Cy^x- Hence Cy^x is a neigh- 
bourhood of z. The last part follows trivially once by noting that for all z G Cx\x' 
we have ft (z) > A' > A and hence Cy^x is a neighbourhood of z. □ 

We now define a pseudo-metric on /. Lemma 12.101 (the only place we will use 
local connectedness) is critical to showing that the map from / to the resulting 
quotient space is continuous. 

Definition 2.11. If y and z are points in /, define A {y, z) < min (ft (y) , ft (z)) such 
that Cy^x = Cz,x 

A (y, z) = sup {A I Cy,x = Cz,x, X<h{y),X<h (z)} . 

The set 

{\\Cy^x = Cz.x, X<h{y), A<ft(z)} 
is a non-empty interval [ft (v) , A (y, z)] or [ft (v) , A (y, z)) where A (y, z) satisfies 

h{v) < X (y, z) < min (ft (y) , ft (z)) . 
Clearly A {x, x) = h {x) . 
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Lemma 2.12. The function A is lower semi- continuous 

liminf A(2/, z) > X{y,zn). 



Proof. Fix y, zq and choose some A' < X{y,zo). By the definition of X{y,zo) we 
have that C^/.a' = C'zq.a'- Since h{zo) > X' there is a neighbourhood U of zq so 

that U C C^i-j^A'- For any z G U one has z S C^^y H C^g^^'- Hence Czq.A' = (^z,X' 
and Cj/,A' = Cz.A'- Thus A (y, z) > A' for z e ?7 and hence 



Lemma 2.13. The following inequality holds 

min {A (x, z) , A (y, z)} < A (.t, y) . 

Proof. If min {A {x, z) ,X{y,z)} = h (w), then there is nothing to prove. Recall that 

{X\Cy,x=C,,x, X<h{y), X<h{x)} 

is connected and contains h (v). Suppose h{v) < X < min {A (x, z) , X {y, z)}, then 
it follows that the identity Cx.x = Cz,x holds for A. Similarly Cy^\ = Cz,\. As a 
result Cx,x = Cy^\ and A {x, y) > X. □ 

Definition 2.14. Define d on / x / by 

d {x, y) =h{x) + h (y) - 2A {x, y) . 



Lemma 2.15. The function d is a pseudo-metric on I. If yljdj is the resulting 

quotient metric space, then the projection I ^ I from the topological space I to the 
metric space is continuous. 

Proof. Clearly d is positive, symmetric and we have remarked that for all x, X {x, x) = 
h {x) hence it is zero on the diagonal. To see the triangle inequality, assume 



liminf A {y, z) > A'. 



Since A' < A (y, zq) was arbitrary 



liminf A (y,z) > A(y, Zq) 



and the result is proved. 



□ 




A {x, z) = min {A {x, z) , A (y, z)} 



and then observe 



d {x, y) = h{x) + h (y) - 2A (x, y) 
< h (x) + h{y) - 2A (.x, z) 
= h (x) + h{z)- 2A {x, z) + h{y)-h (z) 
<d{x,z) + \h{y)-h{z)\ 



but A (y, z) < min {h [y) ,h{z)) and hence 



\h (y) -h{z)\ = h{y) + h{z)-2 min {h (y) , h (z)) 



<h{y) + h{z)-2X (y,z) 



= d (y, z) 



hence 



d{x,y) < d{x,z) + d{y,z) 

as required. 



□ 
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We can now introduce the equivalence relation x~y iid {x, y) ~ and the quotient 
space I/~. We write //"' = / and i : / — > / for the canonical projection. The 
function d projects onto I x I and is a metric there. 

It is tempting to think that x~y if and only if Cx = Cy and this is true if / is 
compact Hausdorff. However the definitions imply a slightly different criteria: x~y 
iff 

h{x) — h (y) — A and C^^x" — Cy^x" for all A" < A. 
The stronger statement x~y if and only if Cx — Cy is not true for all continuous 
functions h on as it is easy to find a decreasing family of closed connected sets 
there whose limit is a closed set that is not connected. 

Consider again the new metric space / that has as its points the equivalence 
classes of points indistinguishable under d. We now prove that the projection i 
taking / to / is continuous. Fix y (z I and e > 0. Since A (y, .) is lower semi- 
continuous and h is (upper semi)continuous there is a neighbourhood [/ of y so that 
for z e [/ one has A (y, z) > A (y, y) — e/4 and h{z) < h (y) + e/2. Thus d (y, z) < e 
for z ^ U . Hence d {i (y) , i (z)) < e if z G [/. The function i is continuous and as 
continuous images of compact sets are compact we have the following. 

Corollary 2.16. /// is compact, then I is a compact metric space. 

To complete this section we will show / is a uniquely arcwise connected metric 
space, in which the arc between two points is isometric to an interval and give a 
characterisation of compact trees. 

Proposition 2.17. If I is a connected and locally connected topological space, and 
h : I ^ M. is a positive continuous function that attains its lower bound, then 

its "contour tree" the metric space (^^i'^ ^•s R-tree. Every R -tree can be 
constructed in this way. 

Proof. It is enough to prove that the metric space / we have constructed is really 
an R-tree and that every R-tree can be constructed in this way. Let x any point 
in / and x E I satisfy i (x) — x. Then h (x) does not depend on the choice of 
X. Fix h{v) < X < h{x). We have seen that there is a y such that h (y)=A and 
y ^ X moreover any two choices have the same contour through them and hence 
the same y (A). In this way we see that there is a map from [h (v) , h (x)] into / 
that is injective. Moreover, it is immediate from the definition of d that it is an 
isometry and that / is uniquely arc connected. 

Suppose that il is an R-tree, then we may fix a base point, and for each point 
in the tree consider the distance from V it is clear that this continuous function is 
just appropriate to ensure that the contour tree is the original tree. □ 

Remark 2.18. 1. In the case where / is compact, obviously / is both complete and 
totally bounded as it is compact. 

2. An i?-tree is a metric space; it is therefore possible to complete it. Indeed the 
completion consists of those paths, all of whose initial segments are in the tre^; 
we have not identified a simple sufficient condition on the continuous function and 
topological space to ensure this. An i?-tree is totally bounded if it is bounded 
and for each e > there is an N so that for each t the paths that extend a distance 



We fix a root and identify the tree with the geodesic arc from the root to the point in the 

tree. 
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t from the root have at most N ancestral paths between them at time t — e. In 
this way we see that the i?-tree that comes out of studying the historical process 
for the Fleming- Viot or the Dawson Watanabe measure-valued processes is, with 
probability one, a compact i?-tree for each finite time. 

Lemma 2.19. Given a compact R-tree, there is always a height function on a 
closed interval that yields the same tree as its quotient. 

Proof. As the tree is compact, path connected and locally path connected, there is 
always as based loop mapping [0, 1] onto the tree. Let h denote the distance from 
the root. Its puUback onto the interval [0, 1] is a height function and the natural 
quotient is the original tree. In this way we see that there is always a version of Le 
Gall's snake [3] traversing a compact tree. □ 
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